Video highlights with user trimming

ABSTRACT

A server configured to receive video clips from a mobile device, such as eyewear. The server has an electronic processor enabled to execute computer instructions to process the video clips to identify one or more characteristics in the frames of the video clips. The processor selects the video clips having the identified characteristics in the frames and creates a set of the selected video clips having the identified characteristics in the frames. The processor allows a user of the mobile device to edit and trim the video clips having the identified characteristics to create trimmed video clip segments.

TECHNICAL FIELD

The present subject matter relates to an eyewear device, e.g., smartglasses.

BACKGROUND

Portable eyewear devices, such as smart glasses, headwear, and headgearavailable today integrate cameras and see-through displays. Theseeyewear devices record video segments, which may be shared with others.

BRIEF DESCRIPTION OF THE DRAWINGS

The drawing figures depict one or more implementations, by way ofexample only, not by way of limitations. In the figures, like referencenumerals refer to the same or similar elements.

FIG. 1A is a side view of an example hardware configuration of aneyewear device, which shows a right optical assembly with an imagedisplay, and field of view adjustments are applied to a user interfacepresented on the image display based on detected head or eye movement bya user;

FIG. 1B is a top cross-sectional view of a temple of the eyewear deviceof FIG. 1A depicting a visible light camera, a head movement tracker fortracking the head movement of the user of the eyewear device, and acircuit board;

FIG. 2A is a rear view of an example hardware configuration of aneyewear device, which includes an eye scanner on a frame, for use in asystem for identifying a user of the eyewear device;

FIG. 2B is a rear view of an example hardware configuration of anothereyewear device, which includes an eye scanner on a temple, for use in asystem for identifying a user of the eyewear device;

FIGS. 2C and 2D are rear views of example hardware configurations of theeyewear device, including two different types of image displays.

FIG. 3 shows a rear perspective view of the eyewear device of FIG. 2Adepicting an infrared emitter, an infrared camera, a frame front, aframe back, and a circuit board;

FIG. 4 is a cross-sectional view taken through the infrared emitter andthe frame of the eyewear device of FIG. 3 ;

FIG. 5 illustrates detecting eye gaze direction;

FIG. 6 illustrates detecting eye position;

FIG. 7 depicts an example of visible light captured by the left visiblelight camera as a left raw image and visible light captured by the rightvisible light camera as a right raw image;

FIG. 8A illustrates a plurality of captured video clips;

FIG. 8B illustrates a plurality of trimmed video clip segments;

FIG. 8C illustrates video highlights generated from the trimmed videoclip segments;

FIG. 9 illustrates a block diagram of electronic components of theeyewear device;

FIG. 10 is a flowchart of the operation of creating video highlightsfrom trimmed video clip segments;

FIG. 11 is a flowchart of creating a summary of trimmed video clipsegments;

FIG. 12 is a functional block diagram of a mobile device;

FIG. 13 is a flowchart of a mobile device receiving and processing avideo highlights with automated trimming of video clips; and

FIG. 14 is a flowchart of a mobile device receiving and processing videohighlights with customized manual trimming of video clips.

DETAILED DESCRIPTION

In an example, video highlights (stories) are automatically generated atthe end of each day. The system looks at each image taken during the day(or a subset thereof) and selects the best portions (highlights), e.g.,based on similarity and time bucketing. This may be performed with aserver configured to receive video clips from a mobile device, such aseyewear. The server has an electronic processor enabled to executecomputer instructions to process the video clips to identify one or morecharacteristics in the frames of the video clips. The processor selectsthe video clips having the identified characteristics in the frames andcreates a set of the selected video clips having the identifiedcharacteristics in the frames. The processor allows a user of the mobiledevice to edit and trim the video clips having the identifiedcharacteristics to create trimmed video clip segments.

Additional obj ects, advantages and novel features of the examples willbe set forth in part in the description which follows, and in part willbecome apparent to those skilled in the art upon examination of thefollowing and the accompanying drawings or may be learned by productionor operation of the examples. The objects and advantages of the presentsubject matter may be realized and attained by means of themethodologies, instrumentalities and combinations particularly pointedout in the appended claims.

In the following detailed description, numerous specific details are setforth by way of examples in order to provide a thorough understanding ofthe relevant teachings. However, it should be apparent to those skilledin the art that the present teachings may be practiced without suchdetails. In other instances, well known methods, procedures, components,and circuitry have been described at a relatively high-level, withoutdetail, in order to avoid unnecessarily obscuring aspects of the presentteachings.

The term “coupled” as used herein refers to any logical, optical,physical or electrical connection, link or the like by which signals orlight produced or supplied by one system element are imparted to anothercoupled element. Unless described otherwise, coupled elements or devicesare not necessarily directly connected to one another and may beseparated by intermediate components, elements or communication mediathat may modify, manipulate or carry the light or signals.

The orientations of the eyewear device, associated components and anycomplete devices incorporating an eye scanner and camera such as shownin any of the drawings, are given by way of example only, forillustration and discussion purposes. In operation for a particularvariable optical processing application, the eyewear device may beoriented in any other direction suitable to the particular applicationof the eyewear device, for example up, down, sideways, or any otherorientation. Also, to the extent used herein, any directional term, suchas front, rear, inwards, outwards, towards, left, right, lateral,longitudinal, up, down, upper, lower, top, bottom and side, are used byway of example only, and are not limiting as to direction or orientationof any optic or component of an optic constructed as otherwise describedherein.

Reference now is made in detail to the examples illustrated in theaccompanying drawings and discussed below.

FIG. 1A is a side view of an example hardware configuration of aneyewear device 100, which includes a right optical assembly 180B with animage display 180D (FIG. 2A). Eyewear device 100 includes multiplevisible light cameras 114A-B (FIG. 7 ) that form a stereo camera, ofwhich the right visible light camera 114B is located on a right temple110B.

The left and right visible light cameras 114A-B have an image sensorthat is sensitive to the visible light range wavelength. Each of thevisible light cameras 114A-B have a different frontward facing angle ofcoverage, for example, visible light camera 114B has the depicted angleof coverage 111B. The angle of coverage is an angle range which theimage sensor of the visible light camera 114A-B picks up electromagneticradiation and generates images. Examples of such visible lights camera114A-B include a high-resolution complementary metal-oxide-semiconductor(CMOS) image sensor and a video graphic array (VGA) camera, such as 640p (e.g., 640 × 480 pixels for a total of 0.3 megapixels), 720 p, or 1080p. Image sensor data from the visible light cameras 114A-B are capturedalong with geolocation data, digitized by an image processor, and storedin a memory.

To provide stereoscopic vision, visible light cameras 114A-B may becoupled to an image processor (element 912 of FIG. 9 ) for digitalprocessing along with a timestamp in which the image of the scene iscaptured. Image processor 912 includes circuitry to receive signals fromthe visible light camera 114A-B and process those signals from thevisible light cameras 114A-B into a format suitable for storage in thememory (element 934 of FIG. 9 ). The timestamp can be added by the imageprocessor 912 or other processor, which controls operation of thevisible light cameras 114A-B. Visible light cameras 114A-B allow thestereo camera to simulate human binocular vision. Stereo cameras providethe ability to reproduce three-dimensional images (element 715 of FIG. 7) based on two captured images (elements 758A-B of FIG. 7 ) from thevisible light cameras 114A-B, respectively, having the same timestamp.Such three-dimensional images 715 allow for an immersive life-likeexperience, e.g., for virtual reality or video gaming. For stereoscopicvision, the pair of images 758A-B are generated at a given moment intime - one image for each of the left and right visible light cameras114A-B. When the pair of generated images 758A-B from the frontwardfacing angles of coverage 111A-B of the left and right visible lightcameras 114A-B are stitched together (e.g., by the image processor 912),depth perception is provided by the optical assembly 180A-B.

In an example, a user interface field of view adjustment system includesthe eyewear device 100. The eyewear device 100 includes a frame 105, aright temple 110B extending from a right lateral side 170B of the frame105, and a see-through image display 180D (FIGS. 2A-B) comprisingoptical assembly 180B to present a graphical user interface to a user.The eyewear device 100 includes the left visible light camera 114Aconnected to the frame 105 or the left temple 110A to capture a firstimage of the scene. Eyewear device 100 further includes the rightvisible light camera 114B connected to the frame 105 or the right temple110B to capture (e.g., simultaneously with the left visible light camera114A) a second image of the scene which partially overlaps the firstimage. Although not shown in FIGS. 1A-B, the user interface field ofview adjustment system further includes the processor 932 coupled to theeyewear device 100 and connected to the visible light cameras 114A-B,the memory 934 accessible to the processor 932, and programming in thememory 934, for example in the eyewear device 100 itself or another partof the user interface field of view adjustment system.

Although not shown in FIG. 1A, the eyewear device 100 also includes ahead movement tracker (element 109 of FIG. 1B) or an eye movementtracker (element 213 of FIG. 2B). Eyewear device 100 further includesthe see-through image displays 180C-D of optical assembly 180A-B forpresenting a sequence of displayed images, and an image display driver(element 942 of FIG. 9 ) coupled to the see-through image displays180C-D of optical assembly 180A-B to control the image displays 180C-Dof optical assembly 180A-B to present the sequence of displayed images715, which are described in further detail below. Eyewear device 100further includes the memory 934 and the processor 932 having access tothe image display driver 942 and the memory 934. Eyewear device 100further includes programming (element 934 of FIG. 9 ) in the memory.Execution of the programming by the processor 932 configures the eyeweardevice 100 to perform functions, including functions to present, via thesee-through image displays 180C-D, an initial displayed image of thesequence of displayed images, the initial displayed image having aninitial field of view corresponding to an initial head direction or aninitial eye gaze direction (element 230 of FIG. 5 ).

Execution of the programming by the processor 932 further configures theeyewear device 100 to detect movement of a user of the eyewear deviceby: (i) tracking, via the head movement tracker (element 109 of FIG.1B), a head movement of a head of the user, or (ii) tracking, via an eyemovement tracker (element 213 of FIG. 2B, FIG. 5 ), an eye movement ofan eye of the user of the eyewear device 100. Execution of theprogramming by the processor 932 further configures the eyewear device100 to determine a field of view adjustment to the initial field of viewof the initial displayed image based on the detected movement of theuser. The field of view adjustment includes a successive field of viewcorresponding to a successive head direction or a successive eyedirection. Execution of the programming by the processor 932 furtherconfigures the eyewear device 100 to generate a successive displayedimage of the sequence of displayed images based on the field of viewadjustment. Execution of the programming by the processor 932 furtherconfigures the eyewear device 100 to present, via the see-through imagedisplays 180C-D of the optical assembly 180A-B, the successive displayedimages.

FIG. 1B is a top cross-sectional view of the temple of the eyeweardevice 100 of FIG. 1A depicting the right visible light camera 114B, ahead movement tracker 109, and a circuit board. Construction andplacement of the left visible light camera 114A is substantially similarto the right visible light camera 114B, except the connections andcoupling are on the left lateral side 170A. As shown, the eyewear device100 includes the right visible light camera 114B and a circuit board,which may be a flexible printed circuit board (PCB) 140B. The righthinge 226B connects the right temple 110B to a right temple 125B of theeyewear device 100. In some examples, components of the right visiblelight camera 114B, the flexible PCB 140B, or other electrical connectorsor contacts may be located on the right temple 125B or the right hinge226B.

As shown, eyewear device 100 has a head movement tracker 109, whichincludes, for example, an inertial measurement unit (IMU). An inertialmeasurement unit is an electronic device that measures and reports abody’s specific force, angular rate, and sometimes the magnetic fieldsurrounding the body, using a combination of accelerometers andgyroscopes, sometimes also magnetometers. The inertial measurement unitworks by detecting linear acceleration using one or more accelerometersand rotational rate using one or more gyroscopes. Typical configurationsof inertial measurement units contain one accelerometer, gyro, andmagnetometer per axis for each of the three axes: horizontal axis forleft-right movement (X), vertical axis (Y) for top-bottom movement, anddepth or distance axis for up-down movement (Z). The accelerometerdetects the gravity vector. The magnetometer defines the rotation in themagnetic field (e.g., facing south, north, etc.) like a compass whichgenerates a heading reference. The three accelerometers to detectacceleration along the horizontal, vertical, and depth axis definedabove, which can be defined relative to the ground, the eyewear device100, or the user wearing the eyewear device 100.

Eyewear device 100 detects movement of the user of the eyewear device100 by tracking, via the head movement tracker 109, the head movement ofthe head of the user. The head movement includes a variation of headdirection on a horizontal axis, a vertical axis, or a combinationthereof from the initial head direction during presentation of theinitial displayed image on the image display. In one example, tracking,via the head movement tracker 109, the head movement of the head of theuser includes measuring, via the inertial measurement unit 109, theinitial head direction on the horizontal axis (e.g., X axis), thevertical axis (e.g., Y axis), or the combination thereof (e.g.,transverse or diagonal movement). Tracking, via the head movementtracker 109, the head movement of the head of the user further includesmeasuring, via the inertial measurement unit 109, a successive headdirection on the horizontal axis, the vertical axis, or the combinationthereof during presentation of the initial displayed image.

Tracking, via the head movement tracker 109, the head movement of thehead of the user further includes determining the variation of headdirection based on both the initial head direction and the successivehead direction. Detecting movement of the user of the eyewear device 100further includes in response to tracking, via the head movement tracker109, the head movement of the head of the user, determining that thevariation of head direction exceeds a deviation angle threshold on thehorizontal axis, the vertical axis, or the combination thereof. Thedeviation angle threshold is between about 3° to 10°. As used herein,the term “about” when referring to an angle means ± 10% from the statedamount.

Variation along the horizontal axis slides three-dimensional objects,such as characters, bitmojis, application icons, etc. in and out of thefield of view by, for example, hiding, unhiding, or otherwise adjustingvisibility of the three-dimensional object. Variation along the verticalaxis, for example, when the user looks upwards, in one example, displaysweather information, time of day, date, calendar appointments, etc. Inanother example, when the user looks downwards on the vertical axis, theeyewear device 100 may power down.

The right temple 110B includes temple body 211 and a temple cap, withthe temple cap omitted in the cross-section of FIG. 1B. Disposed insidethe right temple 110B are various interconnected circuit boards, such asPCBs or flexible PCBs, that include controller circuits for rightvisible light camera 114B, microphone(s) 130, speaker(s) 132, low-powerwireless circuitry (e.g., for wireless short-range network communicationvia Bluetooth™), high-speed wireless circuitry (e.g., for wireless localarea network communication via WiFi).

The right visible light camera 114B is coupled to or disposed on theflexible PCB 140B and covered by a visible light camera cover lens,which is aimed through opening(s) formed in the right temple 110B. Insome examples, the frame 105 connected to the right temple 110B includesthe opening(s) for the visible light camera cover lens. The frame 105includes a front-facing side configured to face outwards away from theeye of the user. The opening for the visible light camera cover lens isformed on and through the front-facing side. In the example, the rightvisible light camera 114B has an outwards facing angle of coverage 111Bwith a line of sight or perspective of the right eye of the user of theeyewear device 100. The visible light camera cover lens can also beadhered to an outwards facing surface of the right temple 110B in whichan opening is formed with an outwards facing angle of coverage, but in adifferent outwards direction. The coupling can also be indirect viaintervening components.

Left (first) visible light camera 114A is connected to the leftsee-through image display 180C of left optical assembly 180A to generatea first background scene of a first successive displayed image. Theright (second) visible light camera 114B is connected to the rightsee-through image display 180D of right optical assembly 180B togenerate a second background scene of a second successive displayedimage. The first background scene and the second background scenepartially overlap to present a three-dimensional observable area of thesuccessive displayed image.

Flexible PCB 140B is disposed inside the right temple 110B and iscoupled to one or more other components housed in the right temple 110B.Although shown as being formed on the circuit boards of the right temple110B, the right visible light camera 114B can be formed on the circuitboards of the left temple 110A, the temples 125A-B, or frame 105.

FIG. 2A is a rear view of an example hardware configuration of aneyewear device 100, which includes an eye scanner 213 (FIG. 2B) on aframe 105, for use in a system for determining an eye position and gazedirection of a wearer/user of the eyewear device 100. As shown in FIG.2A, the eyewear device 100 is in a form configured for wearing by auser, which are eyeglasses in the example of FIG. 2A. The eyewear device100 can take other forms and may incorporate other types of frameworks,for example, a headgear, a headset, or a helmet.

In the eyeglasses example, eyewear device 100 includes the frame 105which includes the left rim 107A connected to the right rim 107B via thebridge 106 adapted for a nose of the user. The left and right rims107A-B include respective apertures 175A-B which hold the respectiveoptical element 180A-B, such as a lens and the see-through displays180C-D. As used herein, the term lens is meant to cover transparent ortranslucent pieces of glass or plastic having curved and flat surfacesthat cause light to converge/diverge or that cause little or noconvergence/divergence.

Although shown as having two optical elements 180A-B, the eyewear device100 can include other arrangements, such as a single optical elementdepending on the application or intended user of the eyewear device 100.As further shown, eyewear device 100 includes the left temple 110Aadjacent the left lateral side 170A of the frame 105 and the righttemple 110B adjacent the right lateral side 170B of the frame 105. Thetemples 110A-B may be integrated into the frame 105 on the respectivesides 170A-B (as illustrated) or implemented as separate componentsattached to the frame 105 on the respective sides 170A-B. Alternatively,the temples 110A-B may be integrated into temples (not shown) attachedto the frame 105.

Visible light cameras typically include a blue light filter to blockinfrared light detection, in an example, the infrared camera 220 (FIG.2B) is a visible light camera, such as a low-resolution video graphicarray (VGA) camera (e.g., 640 × 480 pixels for a total of 0.3megapixels), with the blue filter removed. The infrared emitter 215(FIG. 2B) and the infrared camera 220 are co-located on the frame 105,for example, both are shown as connected to the right temple 210B (FIG.2B). The frame 105 or one or more of the left and right temples 110A-Binclude a circuit board (not shown) that includes the infrared emitter215 and the infrared camera 220. The infrared emitter 215 and theinfrared camera 220 can be connected to the circuit board by soldering,for example.

Other arrangements of the infrared emitter 215 and infrared camera 220can be implemented, including arrangements in which the infrared emitter215 and infrared camera 220 are both on the right rim 107B, or indifferent locations on the frame 105, for example, the infrared emitter215 is on the left rim 107A and the infrared camera 220 is on the rightrim 107B. In another example, the infrared emitter 215 is on the frame105 and the infrared camera 220 is on one of the temples 110A-B, or viceversa. The infrared emitter 215 can be connected essentially anywhere onthe frame 105, left temple 110A, or right temple 110B to emit a patternof infrared light. Similarly, the infrared camera 220 can be connectedessentially anywhere on the frame 105, left temple 110A, or right temple110B to capture at least one reflection variation in the emitted patternof infrared light.

The infrared emitter 215 and infrared camera 220 are arranged to faceinwards towards an eye of the user with a partial or full field of viewof the eye in order to identify the respective eye position and gazedirection. For example, the infrared emitter 215 and infrared camera 220are positioned directly in front of the eye, in the upper part of theframe 105 or in the temples 110A-B at either ends of the frame 105.

FIG. 2B is a rear view of an example hardware configuration of anothereyewear device 200. In this example configuration, the eyewear device200 is depicted as including an eye scanner 213 on a right temple 210B.As shown, an infrared emitter 215 and an infrared camera 220 areco-located on the right temple 210B. It should be understood that theeye scanner 213 or one or more components of the eye scanner 213 can belocated on the left temple 210A and other locations eye scanner 213 canbe varied to be sensitive to different light wavelengths.

Similar to FIG. 2A, the eyewear device 200 includes a frame 105 whichincludes a left rim 107A which is connected to a right rim 107B via abridge 106; and the left and right rims 107A-B include respectiveapertures which hold the respective optical elements 180A-B comprisingthe see-through display 180C-D.

FIGS. 2C-D are rear views of example hardware configurations of theeyewear device 100, including two different types of see-through imagedisplays 180C-D. In one example, these see-through image displays 180C-Dof optical assembly 180A-B include an integrated image display. As shownin FIG. 2C, the optical assemblies 180A-B includes a suitable displaymatrix 180C-D of any suitable type, such as a liquid crystal display(LCD), an organic light-emitting diode (OLED) display, a waveguidedisplay, or any other such display. The optical assembly 180A-B alsoincludes an optical layer or layers 176, which can include lenses,optical coatings, prisms, mirrors, waveguides, optical strips, and otheroptical components in any combination. The optical layers 176A-N caninclude a prism having a suitable size and configuration and including afirst surface for receiving light from display matrix and a secondsurface for emitting light to the eye of the user. The prism of theoptical layers 176A-N extends over all or at least a portion of therespective apertures 175A-B formed in the left and right rims 107A-B topermit the user to see the second surface of the prism when the eye ofthe user is viewing through the corresponding left and right rims107A-B. The first surface of the prism of the optical layers 176A-Nfaces upwardly from the frame 105 and the display matrix overlies theprism so that photons and light emitted by the display matrix impingethe first surface. The prism is sized and shaped so that the light isrefracted within the prism and is directed towards the eye of the userby the second surface of the prism of the optical layers 176A-N. In thisregard, the second surface of the prism of the optical layers 176A-N canbe convex to direct the light towards the center of the eye. The prismcan optionally be sized and shaped to magnify the image projected by thesee-through image displays 180C-D, and the light travels through theprism so that the image viewed from the second surface is larger in oneor more dimensions than the image emitted from the see-through imagedisplays 180C-D.

In another example, the see-through image displays 180C-D of opticalassembly 180A-B include a projection image display as shown in FIG. 2D.The optical assembly 180A-B includes a laser projector 150, which is athree-color laser projector using a scanning mirror or galvanometer.During operation, an optical source such as a laser projector 150 isdisposed in or on one of the temples 125A-B of the eyewear device 100.Optical assembly 180-B includes one or more optical strips 155A-N spacedapart across the width of the lens of the optical assembly 180A-B oracross a depth of the lens between the front surface and the rearsurface of the lens.

As the photons projected by the laser projector 150 travel across thelens of the optical assembly 180A-B, the photons encounter the opticalstrips 155A-N. When a particular photon encounters a particular opticalstrip, the photon is either redirected towards the user’s eye, or itpasses to the next optical strip. A combination of modulation of laserprojector 150, and modulation of optical strips, may control specificphotons or beams of light. In an example, a processor controls opticalstrips 155A-N by initiating mechanical, acoustic, or electromagneticsignals. Although shown as having two optical assemblies 180A-B, theeyewear device 100 can include other arrangements, such as a single orthree optical assemblies, or the optical assembly 180A-B may havearranged different arrangement depending on the application or intendeduser of the eyewear device 100.

As further shown in FIGS. 2C-D, eyewear device 100 includes a lefttemple 110A adjacent the left lateral side 170A of the frame 105 and aright temple 110B adjacent the right lateral side 170B of the frame 105.The temples 110A-B may be integrated into the frame 105 on therespective lateral sides 170A-B (as illustrated) or implemented asseparate components attached to the frame 105 on the respective sides170A-B. Alternatively, the temples 110A-B may be integrated into temples125A-B attached to the frame 105.

In one example, the see-through image displays include the firstsee-through image display 180C and the second see-through image display180D. Eyewear device 100 includes first and second apertures 175A-Bwhich hold the respective first and second optical assembly 180A-B. Thefirst optical assembly 180A includes the first see-through image display180C (e.g., a display matrix of FIG. 2C or optical strips 155A-N′ and aprojector 150A). The second optical assembly 180B includes the secondsee-through image display 180D e.g., a display matrix of FIG. 2C oroptical strips 155A-N″ and a projector 150B). The successive field ofview of the successive displayed image includes an angle of view betweenabout 15° to 30, and more specifically 24°, measured horizontally,vertically, or diagonally. The successive displayed image having thesuccessive field of view represents a combined three-dimensionalobservable area visible through stitching together of two displayedimages presented on the first and second image displays.

As used herein, “an angle of view” describes the angular extent of thefield of view associated with the displayed images presented on each ofthe left and right image displays 180C-D of optical assembly 180A-B. The“angle of coverage” describes the angle range that a lens of visiblelight cameras 114A-B or infrared camera 220 can image. Typically, theimage circle produced by a lens is large enough to cover the film orsensor completely, possibly including some vignetting (i.e., a reductionof an image’s brightness or saturation toward the periphery compared tothe image center). If the angle of coverage of the lens does not fillthe sensor, the image circle will be visible, typically with strongvignetting toward the edge, and the effective angle of view will belimited to the angle of coverage. The “field of view” is intended todescribe the field of observable area which the user of the eyeweardevice 100 can see through his or her eyes via the displayed imagespresented on the left and right image displays 180C-D of the opticalassembly 180A-B. Image display 180C of optical assembly 180A-B can havea field of view with an angle of coverage between 15° to 30°, forexample 24°, and have a resolution of 480 × 480 pixels.

FIG. 3 shows a rear perspective view of the eyewear device of FIG. 2A.but including infrared camera 220, a frame front 330, a frame back 335,and a circuit board 340. It can be seen in FIG. 3 that the upper portionof the left rim of the frame of the eyewear device 100 includes theframe front 330 and the frame back 335. An opening for the infraredemitter 215 is formed on the frame back 335.

As shown in the encircled cross-section 4-4 in the upper middle portionof the left rim of the frame, a circuit board, which is a flexible PCB340, is sandwiched between the frame front 330 and the frame back 335.Also shown in further detail is the attachment of the left temple 110Ato the left temple 325A via the left hinge 326A. In some examples,components of the eye movement tracker 213, including the infraredemitter 215, the flexible PCB 340, or other electrical connectors orcontacts may be located on the left temple 325A or the left hinge 326A.

FIG. 4 is a cross-sectional view through the infrared emitter 215 andthe frame corresponding to the encircled cross-section 4-4 of theeyewear device of FIG. 3 . Multiple layers of the eyewear device 100 areillustrated in the cross-section of FIG. 4 , as shown the frame includesthe frame front 330 and the frame back 335. The flexible PCB 340 isdisposed on the frame front 330 and connected to the frame back 335. Theinfrared emitter 215 is disposed on the flexible PCB 340 and covered byan infrared emitter cover lens 445. For example, the infrared emitter215 is reflowed to the back of the flexible PCB 340. Reflowing attachesthe infrared emitter 215 to contact pad(s) formed on the back of theflexible PCB 340 by subjecting the flexible PCB 340 to controlled heatwhich melts a solder paste to connect the two components. In oneexample, reflowing is used to surface mount the infrared emitter 215 onthe flexible PCB 340 and electrically connect the two components.However, it should be understood that through-holes can be used toconnect leads from the infrared emitter 215 to the flexible PCB 340 viainterconnects, for example.

The frame back 335 includes an infrared emitter opening 450 for theinfrared emitter cover lens 445. The infrared emitter opening 450 isformed on a rear-facing side of the frame back 335 that is configured toface inwards towards the eye of the user. In the example, the flexiblePCB 340 can be connected to the frame front 330 via the flexible PCBadhesive 460. The infrared emitter cover lens 445 can be connected tothe frame back 335 via infrared emitter cover lens adhesive 455. Thecoupling can also be indirect via intervening components.

In an example, the processor 932 utilizes eye tracker 213 to determinean eye gaze direction 230 of a wearer’s eye 234 as shown in FIG. 5 , andan eye position 236 of the wearer’s eye 234 within an eyebox as shown inFIG. 6 . The eye tracker 213 is a scanner which uses infrared lightillumination (e.g., near-infrared, short-wavelength infrared,mid-wavelength infrared, long-wavelength infrared, or far infrared) tocaptured image of reflection variations of infrared light from the eye234 to determine the gaze direction 230 of a pupil 232 of the eye 234,and also the eye position 236 with respect to the see-through display180D.

FIG. 7 depicts an example of capturing visible light with cameras.Visible light is captured by the left visible light camera 114A with aleft visible light camera field of view 111A as a left raw image 758A.Visible light is captured by the right visible light camera 114B with aright visible light camera field of view 111B as a right raw image 758B.The processor 932 (FIG. 9 ) processes the left raw image 758A and theright raw image 758B to create a three-dimensional depth map 715 of athree-dimensional scene, referred to herein as an image frame. Dependingon a camera capture mode, the processor 932 creates a still image frame,and also creates a series of image frames that make up a video clip 800,as shown in FIG. 8A.

FIG. 8A illustrates an example of a plurality of video clips 800 createdby a user of eyewear 100/200. Each video clip 800 is created using thevisible light cameras 114A-114B when the user uses eyewear 100/200 tocapture images of the world about the user. Each video clip 800 isconsidered a raw video clip, meaning the video clip 800 is created fromwhen the user enables the cameras 114A-B until the user disables thecameras, in an unedited form. These video clips 800 are automaticallyuploaded by processor 932 to the server system 998 over network 995 uponcreation.

Referring to FIG. 8B, according to this disclosure, the raw video clips800 are analyzed by a server processor 952 of the server system 998 andautomatically trimmed to create trimmed video clip segments 802 using avideo trimming algorithm 945 comprising computer executable instructionsstored in server memory 954 (FIG. 9 ). As will be described in moredetail with reference to FIG. 11 , the server processor 952 using videotrimming algorithm 945 gathers per-frame information about each of thevideo clips 800 using computer vision (CV) techniques that arecommercially available and stored in server memory 954. The per-frameinformation includes several characteristics, such as the camera wasobstructed or overexposed, the stability of segments in the video clip,segments contain pets, segments contain humans, human emotions, andhuman speech.

Based on a set of rules stored in server memory 954 and the gatheredper-frame information, the server processor 952 automatically determinestrim points and creates the trimmed video clip segments 802, referred toherein as auto editing. The segments of the video clip 800 that do notmeet the set of rules are omitted by server processor 952 from thetrimmed video clip segments 802. For example, the set of rules may be toinclude video segments that include a human and/or pet, and/or for videosegments that have a specified object. The portions of a video clip 800that does not include a human and/or pet are trimmed out of the rawvideo clip 800 to create the trimmed video clip segments 802.

Referring to FIG. 8C, the server processor 952 then creates a summaryfrom the trimmed video clip segments 802 using a video highlightsalgorithm 950 (FIG. 9 ), as will be described in more detail withreference to FIG. 10 . The summary of the trimmed video clip segments802 is a set that is less than all the trimmed video clip segments 802,and is a subset of the trimmed video clip segments 802. The summary oftrimmed video clip segments 802 is referred to as video highlights 804.The video highlights algorithm 950 determines which of the trimmed videoclip segments 802 are suitable to generate the video highlights 804,based on the number of trimmed video clip segments 802, their combinedlength in time, and other user determined variables. If a previous videohighlights 804 already exists on the server system 998 that has not beenpermanently saved by a user of mobile device 995, it is deleted on theserver system 998.

The video highlights 804 is downloaded by the server system 998 to theuser of mobile device 990 for playback at a regular period, such as oncea day in the evening. In one example, the combined length in time of thetrimmed video clip segments 802 processed since the last generated videohighlights 804 needs to meet a predetermined minimum time to generate anew video highlights 804, such as 2 minutes. Limitation to the time ofday, frequency and length is not to be inferred in this disclosure, andthese variables are provided as examples.

FIG. 9 depicts a high-level functional block diagram including exampleelectronic components disposed in eyewear 100 and 200. The illustratedelectronic components include the processor 932, and memory 934 whichincludes the instructions to implement functionality of eyewear 100/200.Processor 932 receives power from battery (not shown) and executes theinstructions stored in memory 934, or integrated with the processor 932on-chip, to perform functionality of eyewear 100/200, and communicatingwith external devices via wireless connections.

A user interface adjustment system 900 includes a wearable device, whichis the eyewear device 100 with an eye movement tracker 213 (e.g., shownas infrared emitter 215 and infrared camera 220 in FIG. 2B). Userinterface adjustments system 900 also includes a mobile device 990 and aserver system 998 connected via various networks. Mobile device 990 maybe a smartphone, tablet, laptop computer, access point, or any othersuch device capable of connecting with eyewear device 100 using both alow-power wireless connection 925 and a high-speed wireless connection937. Mobile device 990 is connected to server system 998 and network995. The network 995 may include any combination of wired and wirelessconnections.

Eyewear device 100 includes at least two visible light cameras 114A-B(one associated with the left lateral side 170A and one associated withthe right lateral side 170B). Eyewear device 100 further includes twosee-through image displays 180C-D of the optical assembly 180A-B (oneassociated with the left lateral side 170A and one associated with theright lateral side 170B). The image displays 180C-D are optional in thisdisclosure. Eyewear device 100 also includes image display driver 942,image processor 912, low-power circuitry 920, and high-speed circuitry930. The components shown in FIG. 9 for the eyewear device 100 arelocated on one or more circuit boards, for example a PCB or flexiblePCB, in the temples. Alternatively, or additionally, the depictedcomponents can be located in the temples, frames, hinges, or bridge ofthe eyewear device 100. Left and right visible light cameras 114A-B caninclude digital camera elements such as a complementarymetal-oxide-semiconductor (CMOS) image sensor, charge coupled device, alens, or any other respective visible or light capturing elements thatmay be used to capture data, including images of scenes with unknownobjects.

Eye movement tracking programming 945 implements the user interfacefield of view adjustment instructions, including, to cause the eyeweardevice 100 to track, via the eye movement tracker 213, the eye movementof the eye of the user of the eyewear device 100. Other implementedinstructions (functions) cause the eyewear device 100 to determine, afield of view adjustment to the initial field of view of an initialdisplayed image based on the detected eye movement of the usercorresponding to a successive eye direction. Further implementedinstructions generate a successive displayed image of the sequence ofdisplayed images based on the field of view adjustment. The successivedisplayed image is produced as visible output to the user via the userinterface. This visible output appears on the see-through image displays180C-D of optical assembly 180A-B, which is driven by image displaydriver 934 to present the sequence of displayed images, including theinitial displayed image with the initial field of view and thesuccessive displayed image with the successive field of view.

As shown in FIG. 9 , high-speed circuitry 930 includes high-speedprocessor 932, memory 934, and high-speed wireless circuitry 936. In theexample, the image display driver 942 is coupled to the high-speedcircuitry 930 and operated by the high-speed processor 932 in order todrive the left and right image displays 180C-D of the optical assembly180A-B. High-speed processor 932 may be any processor capable ofmanaging high-speed communications and operation of any generalcomputing system needed for eyewear device 100. High-speed processor 932includes processing resources needed for managing high-speed datatransfers on high-speed wireless connection 937 to a wireless local areanetwork (WLAN) using high-speed wireless circuitry 936. In certainexamples, the high-speed processor 932 executes an operating system suchas a LINUX operating system or other such operating system of theeyewear device 100 and the operating system is stored in memory 934 forexecution. In addition to any other responsibilities, the high-speedprocessor 932 executing a software architecture for the eyewear device100 is used to manage data transfers with high-speed wireless circuitry936. In certain examples, high-speed wireless circuitry 936 isconfigured to implement Institute of Electrical and Electronic Engineers(IEEE) 802.11 communication standards, also referred to herein as Wi-Fi.In other examples, other high-speed communications standards may beimplemented by high-speed wireless circuitry 936.

Low-power wireless circuitry 924 and the high-speed wireless circuitry936 of the eyewear device 100 can include short range transceivers(Bluetooth™) and wireless wide, local, or wide area network transceivers(e.g., cellular or WiFi). Mobile device 990, including the transceiverscommunicating via the low-power wireless connection 925 and high-speedwireless connection 937, may be implemented using details of thearchitecture of the eyewear device 100, as can other elements of network995.

Memory 934 includes any storage device capable of storing various dataand applications, including, among other things, color maps, camera datagenerated by the left and right visible light cameras 114A-B and theimage processor 912, as well as images generated for display by theimage display driver 942 on the see-through image displays 180C-D of theoptical assembly 180A-B. While memory 934 is shown as integrated withhigh-speed circuitry 930, in other examples, memory 934 may be anindependent standalone element of the eyewear device 100. In certainsuch examples, electrical routing lines may provide a connection througha chip that includes the high-speed processor 932 from the imageprocessor 912 or low-power processor 922 to the memory 934. In otherexamples, the high-speed processor 932 may manage addressing of memory934 such that the low-power processor 922 will boot the high-speedprocessor 932 any time that a read or write operation involving memory934 is needed.

Server system 998 may be one or more computing devices as part of aservice or network computing system, for example, that includes aprocessor 952, a memory 954, and network communication interface tocommunicate over the network 995 with the mobile device 990 and eyeweardevice 100 and 200. Eyewear device 100 and 200 is connected with a hostcomputer. For example, the eyewear device 100 and 200 is paired with themobile device 990 via the high-speed wireless connection 937 orconnected to the server system 998 via the network 995.

Output components of the eyewear device 100 include visual components,such as the left and right image displays 180C-D of optical assembly180A-B as described in FIGS. 2C-D (e.g., a display such as a liquidcrystal display (LCD), a plasma display panel (PDP), a light emittingdiode (LED) display, a projector, or a waveguide). The image displays180C-D of the optical assembly 180A-B are driven by the image displaydriver 942. The output components of the eyewear device 100 furtherinclude acoustic components (e.g., speakers), haptic components (e.g., avibratory motor), other signal generators, and so forth. The inputcomponents of the eyewear device 100, the mobile device 990, and serversystem 998, may include alphanumeric input components (e.g., a keyboard,a touch screen configured to receive alphanumeric input, a photo-opticalkeyboard, or other alphanumeric input components), point-based inputcomponents (e.g., a mouse, a touchpad, a trackball, a joystick a motionsensor, or other pointing instruments), tactile input components (e.g.,a physical button, a touch screen that provides location and force oftouches or touch gestures, or other tactile input components), audioinput components (e.g., a microphone), and the like.

Eyewear device 100 may optionally include additional peripheral deviceelements. Such peripheral device elements may include biometric sensors,additional sensors, or display elements integrated with eyewear device100. For example, peripheral device elements may include any I/Ocomponents including output components, motion components, positioncomponents, or any other such elements described herein.

For example, the biometric components of the user interface field ofview adjustment 900 include components to detect expressions (e.g., handexpressions, facial expressions, vocal expressions, body gestures, oreye tracking), measure biosignals (e.g., blood pressure, heart rate,body temperature, perspiration, or brain waves), identify a person(e.g., voice identification, retinal identification, facialidentification, fingerprint identification, or electroencephalogrambased identification), and the like. The motion components includeacceleration sensor components (e.g., accelerometer), gravitation sensorcomponents, rotation sensor components (e.g., gyroscope), and so forth.The position components include location sensor components to generatelocation coordinates (e.g., a Global Positioning System (GPS) receivercomponent), WiFi or Bluetooth™ transceivers to generate positioningsystem coordinates, altitude sensor components (e.g., altimeters orbarometers that detect air pressure from which altitude may be derived),orientation sensor components (e.g., magnetometers), and the like. Suchpositioning system coordinates can also be received over wirelessconnections 925 and 937 from the mobile device 990 via the low-powerwireless circuitry 924 or high-speed wireless circuitry 936.

According to some examples, an “application” or “applications” areprogram(s) that execute functions defined in the programs. Variousprogramming languages can be employed to create one or more of theapplications, structured in a variety of manners, such asobject-oriented programming languages (e.g., Objective-C, Java, or C++)or procedural programming languages (e.g., C or assembly language). In aspecific example, a third party application (e.g., an applicationdeveloped using the ANDROID™ or IOS™ software development kit (SDK) byan entity other than the vendor of the particular platform) may bemobile software running on a mobile operating system such as IOS™,ANDROID™, WINDOWS® Phone, or another mobile operating systems. In thisexample, the third-party application can invoke API calls provided bythe operating system to facilitate functionality described herein.

FIG. 10 is a flowchart of the video highlights algorithm 950illustrating the operation of the server system 998 upon receivinguploaded video clips 800 from eyewear device 100/200, created by thehigh-speed processor 952 executing instructions stored in memory 954, tocreate the video highlights 804. Although shown as occurring serially,the blocks of FIG. 10 may be reordered or parallelized depending on theimplementation. If a previous video highlights 804 already exists on theserver system 998 that has not been permanently saved by a user ofmobile device 995, it is deleted on the server system 998.

Blocks 1002-1010 1012 are performed by processor 952 implementing thevideo highlights algorithm 950.

At block 1002, the processor 952 collects video clips 800 over network995 from the eyewear 100/200 using the cameras 114A-B as previouslydescribed with reference to FIG. 7 , and FIG. 8A. The eyewear 100/200automatically uploads all captured video clips 800 to the server system998 upon creation. The plurality of video clips 800 are stored in memory954 and are a set of raw unedited video clips as discussed and shown inFIG. 8A. The set of video clips 800 are created and stored since a lastvideo highlights was created. In one example, a video highlights 804 iscreated periodically, such as every 24 hours at a set time, such as froma set time of 8 pm each day.

At block 1004, the server processor 952 using video trimming algorithm945 gathers per-frame information about each of the video clips 800using computer vision (CV) techniques that are commercially availableand stored in server memory 954. The per-frame information includesseveral characteristics, such as the camera was obstructed oroverexposed, the stability of segments in the video clip, segmentscontain pets, segments contain humans, human emotions, and human speech.

At block 1006, based on a set of rules stored in server memory 954 andthe gathered per-frame information, the server processor 952automatically determines trim points and creates the trimmed video clipsegments 802 (FIG. 8B), referred to herein as auto editing. The segmentsof the video clip 800 that do not meet the set of rules are omitted byserver processor 952 from the trimmed video clip segments 802. Forexample, the set of rules may be to include video segments that includea human and/or pet, and/or for video segments that have a specifiedobject. The portions of a video clip 800 that does not include a humanand/or pet are trimmed out of the raw video clip 800 to create thetrimmed video clip segments 802.

At block 1008, the processor 952 decides if the trimmed video clipsegments 802 are suitable to generate the video highlights 804, such asbased on the number of the video clip segments 802, such as five, andtheir combined length in time. If the combined length of time is greaterthan a predetermined time limit, such as 5 or 10 minutes for example,the processor 952 compares the trimmed video clip segments 802 to removesimilar segments. If the combined length of time is still greater thanthe predetermined limit, the processor will continue to remove trimmedvideo clip segments 802 based on a similarity score created by theprocessor 952. For instance, the processor 952 can compare two trimmedvideo clips 802 having a high degree of similarity and retaining thetrimmed video clip having a higher quality and/or a shorter duration,remove frames within video clips without affecting context, anditerating this process until the desired length is achieved. Not all ofthe trimmed video clip segments 802 may be included in the videohighlights 804.

At block 1010, the processor 952 generates the video highlights 804based on the remaining trimmed video clip segments from block 1006 asshown in FIG. 8C. The video highlights 804 is stored in memory 934.

At block 1012, the processor 952 generates a notification indicativethat the video highlights 804 is ready. The notification is sent to auser’s mobile device 990 via the network 995 as shown in FIG. 9 . Theuser can then download and view the video highlights 804 as will bediscussed with respect to FIG. 12 shortly.

Referring now to FIG. 11 , there is shown the video trimming algorithm945 performed by the processor 952 on each of the video clips 800 usingCV techniques as discussed with respect to block 1004 of FIG. 10 . Thisvideo trimming algorithm 945 processes each of the video clips 800 toascertain characteristics of the video clips 800, such as to identifyframes and segments of the video clips 800 that have certaincharacteristics. This information is used by the processor 952 toautomatically create the trimmed video clip segments 802, and the videohighlights 804.

At block 1102, the processor 952 performs a bad scenario detection oneach of the video clips 800. This detection detects if camera 114Aand/or camera 114B was partially or totally obstructed or overexposedduring capture of frames, and flags those video clips 800 to be removedfrom the video highlights 804.

At block 1104, a video stability detection is performed by processor952. Processor 952 searches for stable (slow/no moving of camera)segments of the video clips 800.

At block 1106, the processor 952 performs human/pet detection. Thisdetection finds segments of video clips 800 containing a human or pet.This detection can also be done for any other interesting objects, suchas cars, planes, buildings, etc.

At block 1108, the processor 952 performs human/pet tracking. Thisdetection determines if an object detected in each video clip segment isthe center of attention of the video clip 800.

At block 1110, the processor 952 performs emotion detection. Theprocessor 952 finds segments of the video clip 800 that contains a humanemotion, such as a smile.

At block 1112, the processor 952 performs speech detection. Theprocessor 952 searches for segments of the video clips 800 with humanspeech. This helps avoid trimming the middle of a video clip 800 in themiddle of a sentence or word.

At block 1114, the processor 952 creates the trimmed video clip segments802 based on the set of rules, such as segments that satisfy thedetection of blocks 1002-1012. The trimmed video clips 802 that meet theset of rules are the clips 802 that are the video highlights 804.

FIG. 12 is a high-level functional block diagram of an example mobiledevice 990 that communicates via network 995 with server system 998 ofFIG. 9 . Shown are elements of a touch screen type mobile device 990having video highlights editing algorithm 1200, although other non-touchtype mobile devices can be used under consideration here. Examples oftouch screen type mobile devices that may be used include (but are notlimited to) a smart phone, a personal digital assistant (PDA), a tabletcomputer, a laptop computer, or other portable device. However, thestructure and operation of the touch screen type devices is provided byway of example, and the subject technology as described herein is notintended to be limited thereto. For purposes of this discussion, FIG. 12therefore provides a block diagram illustration of the example mobiledevice 990 having a touch screen display 1290 driven by a display driver1090 for displaying content and receiving user input as (or as part of)the user interface. Mobile device 990 also includes a camera(s) 1270,such as visible light camera(s).

As shown in FIG. 12 , the mobile device 990 includes at least onedigital transceiver (XCVR) 1210, shown as WWAN XCVRs, for digitalwireless communications via a wide area wireless mobile communicationnetwork 995. The mobile device 990 also includes additional digital oranalog transceivers, such as short range XCVRs 1220 for short-rangenetwork communication, such as via NFC, VLC, DECT, ZigBee, Bluetooth™,or WiFi. For example, short range XCVRs 1220 may take the form of anyavailable two-way wireless local area network (WLAN) transceiver of atype that is compatible with one or more standard protocols ofcommunication implemented in wireless local area networks, such as oneof the Wi-Fi standards under IEEE 802.11 and 4G LTE.

To generate location coordinates for positioning of the mobile device990, the mobile device 990 can include a global positioning system (GPS)receiver. Alternatively, or additionally the mobile device 990 canutilize either or both the short range XCVRs 1220 and WWAN XCVRs 1210for generating location coordinates for positioning. For example,cellular network, WiFi, or Bluetooth™ based positioning systems cangenerate very accurate location coordinates, particularly when used incombination. Such location coordinates can be transmitted to the eyeweardevice 100/200 over one or more network connections via XCVRs 1220.

The transceivers 1210, 1220 (network communication interface) conformsto one or more of the various digital wireless communication standardsutilized by modern mobile networks. Examples of WWAN transceivers 1210include (but are not limited to) transceivers configured to operate inaccordance with Code Division Multiple Access (CDMA) and 3rd GenerationPartnership Project (3GPP) network technologies including, for exampleand without limitation, 3GPP type 2 (or 3GPP2) and LTE, at timesreferred to as “4G.” For example, the transceivers 1210, 1220 providetwo-way wireless communication of information including digitized audiosignals, still image and video signals, web page information for displayas well as web related inputs, and various types of mobile messagecommunications to/from the mobile device 990 for user identificationstrategies.

Several of these types of communications through the transceivers 1210,1220 and network 995, as discussed previously, relate to protocols andprocedures in support of communications with the server system 998. Suchcommunications, for example, may transport packet data via the shortrange XCVRs 1220 over the wireless connections of network 995 to andfrom the server system 998 as shown in FIG. 9 . Such communications, forexample, may also transport data utilizing IP packet data transport viathe WWAN XCVRs 1210 over the network (e.g., Internet) 995 shown in FIG.9 . Both WWAN XCVRs 1210 and short range XCVRs 1220 connect throughradio frequency (RF) send-and-receive amplifiers (not shown) to anassociated antenna (not shown).

The mobile device 990 further includes a microprocessor 1230, shown as aCPU, sometimes referred to herein as the host controller. A processor isa circuit having elements structured and arranged to perform one or moreprocessing functions, typically various data processing functions.Although discrete logic components could be used, the examples utilizecomponents forming a programmable CPU. A microprocessor for exampleincludes one or more integrated circuit (IC) chips incorporating theelectronic elements to perform the functions of the CPU. The processor1230, for example, may be based on any known or available microprocessorarchitecture, such as a Reduced Instruction Set Computing (RISC) usingan ARM architecture, as commonly used today in mobile devices and otherportable electronic devices. Of course, other processor circuitry may beused to form the CPU 1230 or processor hardware in smartphone, laptopcomputer, and tablet.

The microprocessor 1230 serves as a programmable host controller for themobile device 990 by configuring the mobile device to perform variousoperations, for example, in accordance with instructions or programmingexecutable by processor 1230. For example, such operations may includevarious general operations of the mobile device, as well as operationsrelated to performance metric monitoring, reporting to server system998, and gating. Although a processor may be configured by use ofhardwired logic, typical processors in mobile devices are generalprocessing circuits configured by execution of programming.

The mobile device 990 includes a memory or storage device system, forstoring data and programming. In the example, the memory system mayinclude a flash memory 1240 and a random access memory (RAM) 1242. TheRAM 1242 serves as short term storage for instructions and data beinghandled by the processor 1230, e.g. as a working data processing memory.The flash memory 1240 typically provides longer term storage.

Hence, in the example of mobile device 990, the flash memory 1240 isused to store programming or instructions for execution by the processor1230. Depending on the type of device, the mobile device 990 stores andruns a mobile operating system through which specific applications.Applications, such as video highlights editing 1200, may be a nativeapplication, a hybrid application, or a web application (e.g., a dynamicweb page executed by a web browser) that runs on mobile device 990 touniquely identify the user. Examples of mobile operating systems includeGoogle Android®, Apple iOS® (I-Phone or iPad devices), Windows Mobile®,Amazon Fire OS®, RIM BlackBerry® operating system, or the like.

Referring to FIG. 13 , there is shown a device algorithm 1300 performedby processor 1230 of mobile device 990.

At block 1302, when the mobile device 990 receives a notification asdiscussed with respect to block 1010 of FIG. 10 from the server system998 that a new video highlights 804 is available, at block 1304 themobile device 990 automatically downloads the trimmed video clipsegments 802 of video highlights 804 over the network 995 and storesthem in memory, such as RAM 1240, of mobile device 995.

At block 1306, after completing the download of the trimmed video clipsegments 802 of video highlights 804, the processor 1230 responsivelydisplays a thumbnail image 1260 indicative of the downloaded trimmedvideo clip segments 802 comprising video highlights 804 on touch screendisplay 1290 with the title (e.g. the date) of the video highlights 804.These trimmed video clip segments 802 of video highlights 804 are storedin memory, such as RAM 1240.

At block 1308, when the user taps on the thumbnail 1260, the trimmedvideo clip segments 802 responsively begin to play back in order ondisplay 1290. The trimmed video clip segments 802 will play in orderuntil the last one plays. The user can pause the playback by touchingthe display 1290, and can resume playback by touching the display again.

At block 1310, if the user performs a long press on the displayedthumbnail 1260, or taps on an options disclosure icon displayed ondisplay 1290, a menu of actions is displayed on display 1290. The userwill have options to perform.

At block 1312, tapping on items in the menu allows the user topermanently save the video highlights 804 into memory, such as flashmemory 1242 on the mobile device 990 or to the cloud. The user can alsosend the video highlights 804 to other users, and delete the videohighlights 804. The video highlights 804 are automatically saved for apredetermined period of time, such as three days. This time period canbe set/edited by the user.

User Editing

In another example, the user can edit the exact trim points of selectedvideo clips 800 that are determined by processor 952 to have certaincharacteristics, such as shown and discussed with respect to FIG. 11 .Thus, rather than processor 952 automatically determining the trimpoints of the trimmed video clip segments 802, the server system 998downloads the untrimmed video clips 800 as video highlights 804 that aredetermined by processor 952 to have the certain characteristics tomobile device 995. This allows the user to custom create the trimmedvideo clip segments 802 in video highlights 804, such as to later use inmusic, title cards, etc.

Referring to FIG. 14 , there is shown an algorithm 1400 performed byprocessor 1230 of mobile device 990.

At block 1402, when the mobile device 990 receives a notification asdiscussed with respect to block 1010 of FIG. 10 from the server system998 that a new video highlights 804 with selected untrimmed video clips800 having certain characteristics is available, at block 1404 themobile device 990 automatically downloads the selected video clips 800of video highlights 804 over the network 995 and stores them in memory,such as RAM 1240, of mobile device 995.

At block 1406, after completing the download of the selected untrimmedvideo clips 800 of video highlights 804, the processor 1230 responsivelydisplays a thumbnail image 1260 indicative of the downloaded untrimmedvideo clips 800 comprising video highlights 804 on touch screen display1290 with the title (e.g. the date) of the video highlights 804. Theseuntrimmed video clips 800 of video highlights 804 are stored in memory,such as RAM 1240.

At block 1408, when the user taps on the thumbnail 1260, the untrimmedvideo clips 800 are displayed on display 1290 and can be individuallyselected by the user and trimmed as desired to create trimmed video clipsegments 802 using displayed edit features. Thus, the video highlights804 now includes the user trimmed video clip segments 802.

At block 1410, the trimmed video clip segments 802 of video highlights804 can be selected to play in order until the last one plays. The usercan pause the playback by touching the display 1290, and can resumeplayback by touching the display again.

At block 1412, if the user performs a long press on the displayedthumbnail 1260, or taps on an options disclosure icon displayed ondisplay 1290, a menu of actions is displayed on display 1290. The userwill have options to perform.

At block 1414, tapping on items in the menu allows the user topermanently save the video highlights 804 into memory, such as flashmemory 1242 on the mobile device 990 or to the cloud. The user can alsosend the video highlights 804 to other users, and delete the videohighlights 804. The video highlights 804 are automatically saved for apredetermined period of time, such as three days. This time period canbe set/edited by the user.

Auto Story User Signals

A behind-the-scenes feature enhancement allows the auto-story algorithmto get smarter and more dynamic over time on a per-user basis. Signalsare added and include Highlight, View, Post, Send, Export, and Edit.

If a user manually highlights an image, this is tracked as a Highlightaction.

If a user manually views an image, individually, either from anindividual thumbnail or from a thumbnail in an expanded story, this istraced as an Individual View.

If a user posts a single image to their story, this is tracked as a Postaction.

If a user shares a single image with a friend, this is tracked as a Sendaction.

If a user exports an image, this is tracked as an Export action.

If a user adds an edit to an image, this is tracked as an Edit item.

It will be understood that the terms and expressions used herein havethe ordinary meaning as is accorded to such terms and expressions withrespect to their corresponding respective areas of inquiry and studyexcept where specific meanings have otherwise been set forth herein.Relational terms such as first and second and the like may be usedsolely to distinguish one entity or action from another withoutnecessarily requiring or implying any actual such relationship or orderbetween such entities or actions. The terms “comprises,” “comprising,”“includes,” “including,” or any other variation thereof, are intended tocover a non-exclusive inclusion, such that a process, method, article,or apparatus that comprises or includes a list of elements or steps doesnot include only those elements or steps but may include other elementsor steps not expressly listed or inherent to such process, method,article, or apparatus. An element preceded by “a” or “an” does not,without further constraints, preclude the existence of additionalidentical elements in the process, method, article, or apparatus thatcomprises the element.

Unless otherwise stated, any and all measurements, values, ratings,positions, magnitudes, sizes, and other specifications that are setforth in this specification, including in the claims that follow, areapproximate, not exact. Such amounts are intended to have a reasonablerange that is consistent with the functions to which they relate andwith what is customary in the art to which they pertain. For example,unless expressly stated otherwise, a parameter value or the like mayvary by as much as ± 10% from the stated amount.

In addition, in the foregoing Detailed Description, it can be seen thatvarious features are grouped together in various examples for thepurpose of streamlining the disclosure. This method of disclosure is notto be interpreted as reflecting an intention that the claimed examplesrequire more features than are expressly recited in each claim. Rather,as the following claims reflect, the subject matter to be protected liesin less than all features of any single disclosed example. Thus, thefollowing claims are hereby incorporated into the Detailed Description,with each claim standing on its own as a separately claimed subjectmatter.

While the foregoing has described what are considered to be the bestmode and other examples, it is understood that various modifications maybe made therein and that the subject matter disclosed herein may beimplemented in various forms and examples, and that they may be appliedin numerous applications, only some of which have been described herein.It is intended by the following claims to claim any and allmodifications and variations that fall within the true scope of thepresent concepts.

What is claimed is:
 1. A server having an electronic processor, theprocessor configured to execute computer instructions to: receive aplurality of video clips, each comprised of video frames from a mobiledevice, captured over a predetermined time period; process the pluralityof video clips to gather per-frame information about each video clip toidentify one or more characteristics in respective video frames of theplurality of video clips; generate a set of selected video clips havingthe identified characteristics in the respective video frames, the oneor more characteristics comprising at least one of whether a camera fromwhich the video clips were taken was obstructed or overexposed, astability of the video frames of the video clip, or whether the videoframes contain pets, a specified object, humans, human speech, or humanemotions; automatically trim the set of selected video clips at theserver using the per-frame information in response to a set of rules togenerate a set of trimmed video clip segments, the set of rulesspecifying inclusion of selected video clips that include at least oneof a human, a pet, or an object in the trimmed video clip segments;determine which of the trimmed video clip segments to include in a videosummary based on a number of the trimmed video clip segments, a combinedlength in time of the trimmed video clip segments, or a user determinedvariable including at least one of time of day, frequency, or length oftrimmed video clip segments; when the combined length in time of thetrimmed video clip segments is greater than a predetermined limit,compare the trimmed video clip segments to each other based on asimilarity score of the video frames using the identifiedcharacteristics to remove similar said video clip segments havingsimilar video frames and to generate a reduced set of trimmed video clipsegments and to delete other portions of the selected video clips fromthe reduced set of trimmed video clip segments that do not satisfy theset of rules and that have not been permanently saved by a user of themobile device; send the reduced set of trimmed video clip segments tothe mobile device at a selected time of day; and enable the user of themobile device to trim the reduced set of trimmed video clips to createuser trimmed video clip segments.
 2. The server as specified in claim 1,wherein the mobile device is eyewear and the reduced set of trimmedvideo clip segments has a predetermined time length.
 3. The server asspecified in claim 1, wherein the processor is further configured toreject at least one of the selected video clips from inclusion in theset of trimmed video clip segments based on an image feature of therejected video clip.
 4. The server as specified in claim 3, wherein theimage feature of the at least one of the rejected video clips isindicative that a camera that captured the rejected video clip waspartially or totally obstructed or overexposed during capture.
 5. Theserver as specified in claim 1, wherein the processor is furtherconfigured to enable the user to selectively establish a beginning frameof the user trimmed video clip segments.
 6. The server as specified inclaim 5, wherein the processor is configured to use computer vision (CV)to identify the one or more characteristics in the video frames of theplurality of video clips.
 7. A method of use of a server having anelectronic processor and computer instructions, the processor: receivinga plurality of video clips, each comprised of video frames from a mobiledevice, captured over a predetermined time period; processing theplurality of video clips to gather per-frame information about eachvideo clip to identify one or more characteristics in respective videoframes of the plurality of video clips; generating a set of selectedvideo clips having the identified characteristics in the respectivevideo frames, the one or more characteristics comprising at least one ofwhether a camera from which the video clips were taken was obstructed oroverexposed, a stability of the video frames of the video clip, orwhether the video frames contain pets, a specified object, humans, humanspeech, or human emotions; automatically trimming the set of selectedvideo clips at the server using the per-frame information in response toa set of rules to generate a set of trimmed video clip segments, the setof rules specifying inclusion of selected video clips that include atleast one of a human, a pet, or an object in the trimmed video clipsegments; determining which of the trimmed video clip segments toinclude in a video summary based on a number of the trimmed video clipsegments, a combined length in time of the trimmed video clip segments,or a user determined variable including at least one of time of day,frequency, or length of trimmed video clip segments; when the combinedlength in time of the trimmed video clip segments is greater than apredetermined limit, the server comparing the trimmed video clipsegments to each other based on a similarity score of the video framesusing the identified characteristics to remove similar said video clipsegments having similar video frames and to generate a reduced set oftrimmed video clip segments and to delete other portions of the selectedvideo clips from the reduced set of trimmed video clip segments that donot satisfy the set of rules and that have not been permanently saved bya user of the mobile device; sending the reduced set of trimmed videoclip segments to the mobile device at a selected time of day; andenabling the user of the mobile device to trim the reduced set oftrimmed video clips to create user trimmed video clip segments.
 8. Themethod of claim 7, wherein the mobile device is eyewear and the reducedset of trimmed video clip segments has a predetermined time length. 9.The method as specified in claim 7, further comprising the processorrejecting at least one of the selected video clips from inclusion in theset of trimmed video clip segments based on an image feature of therejected video clip.
 10. The method as specified in claim 9, wherein theimage feature of the at least one of the rejected video clips isindicative that a camera that captured the rejected video clip waspartially or totally obstructed or overexposed during capture.
 11. Themethod as specified in claim 7, further comprising the user selectivelyestablishing a beginning frame of the user trimmed video clip segments.12. The method as specified in claim 11, further comprising theprocessor using computer vision (CV) to identify the one or morecharacteristics in the video frames of the plurality of video clips. 13.A non-transitory computer-readable medium storing program code which,when executed, is operative to cause a computing device to perform thesteps of: receiving a plurality of video clips, each comprised of videoframes from a mobile device, captured over a predetermined time period;processing the plurality of video clips to gather per-frame informationabout each video clip to identify one or more characteristics inrespective video frames of the plurality of video clips; generating aset of selected video clips having the identified characteristics in therespective video frames, the one or more characteristics comprising atleast one of whether a camera from which the video clips were taken wasobstructed or overexposed, a stability of the video frames of the videoclip, or whether the video frames contain pets, a specified object,humans, human speech, or human emotions; automatically trimming the setof selected video clips at a server using the per-frame information inresponse to a set of rules to generate a set of trimmed video clipsegments, the set of rules specifying inclusion of selected video clipsthat include at least one of a human, a pet, or an object in the trimmedvideo clip segments; determining which of the trimmed video clipsegments to include in a video summary based on a number of the trimmedvideo clip segments, a combined length in time of the trimmed video clipsegments, or a user determined variable including at least one of timeof day, frequency, or length of trimmed video clip segments; when thecombined length in time of the trimmed video clip segments is greaterthan a predetermined limit, comparing the trimmed video clip segments toeach other based on a similarity score of the video frames using theidentified characteristics to remove similar said video clip segmentshaving similar video frames and to generate a reduced set of trimmedvideo clip segments and to delete other portions of the selected videoclips from the reduced set of trimmed video clip segments that do notsatisfy the set of rules and that have not been permanently saved by auser of the mobile device; sending the reduced set of trimmed video clipsegments to the mobile device at a selected time of day; and enablingthe user of the mobile device to trim the reduced set of trimmed videoclips to create trimmed video clip segments.
 14. The non-transitorycomputer readable medium as specified in claim 13 , wherein the mobiledevice is eyewear and the reduced set of trimmed video clip segments hasa predetermined time length.
 15. The non-transitory computer readablemedium as specified in claim 13 , wherein the code is operative toenable the computing device to reject at least one of the selected videoclips from inclusion in the set of trimmed video clip segments based onan image feature of the rejected video clip.
 16. The non-transitorycomputer readable medium as specified in claim 15 , wherein the imagefeature of the at least one of the rejected video clips is indicativethat a camera that captured the rejected video clip was partially ortotally obstructed or overexposed during capture.
 17. The non-transitorycomputer readable medium as specified in claim 13 , wherein the code isoperative to enable the user to selectively establish a beginning frameof the user trimmed video clip segments.
 18. The non-transitory computerreadable medium as specified in claim 17 , further includinginstructions for using computer vision (CV) to identify the one or morecharacteristics in the video frames of the plurality of video clips.